skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Search for: All records

Creators/Authors contains: "White, Brandon"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Free, publicly-accessible full text available June 19, 2026
  2. Free, publicly-accessible full text available June 2, 2026
  3. Background: Datasets on rare diseases, like pediatric acute myeloid leukemia (AML) and acute lymphoblastic leukemia (ALL), have small sample sizes that hinder machine learning (ML). The objective was to develop an interpretable ML framework to elucidate actionable insights from small tabular rare disease datasets. Methods: The comprehensive framework employed optimized data imputation and sampling, supervised and unsupervised learning, and literature-based discovery (LBD). The framework was deployed to assess treatment-related infection in pediatric AML and ALL. Results: An interpretable decision tree classified the risk of infection as either “high risk” or “low risk” in pediatric ALL (n = 580) and AML (n = 132) with accuracy of ∼79%. Interpretable regression models predicted the discrete number of developed infections with a mean absolute error (MAE) of 2.26 for bacterial infections and an MAE of 1.29 for viral infections. Features that best explained the development of infection were the chemotherapy regimen, cancer cells in the central nervous system at initial diagnosis, chemotherapy course, leukemia type, Down syndrome, race, and National Cancer Institute risk classification. Finally, SemNet 2.0, an open-source LBD software that links relationships from 33+ million PubMed articles, identified additional features for the prediction of infection, like glucose, iron, neutropenia-reducing growth factors, and systemic lupus erythematosus (SLE). Conclusions: The developed ML framework enabled state-of-the-art, interpretable predictions using rare disease tabular datasets. ML model performance baselines were successfully produced to predict infection in pediatric AML and ALL. 
    more » « less
  4. null (Ed.)
    Link prediction in artificial intelligence is used to identify missing links or derive future relationships that can occur in complex networks. A link prediction model was developed using the complex heterogeneous biomedical knowledge graph, SemNet, to predict missing links in biomedical literature for drug discovery. A web application visualized knowledge graph embeddings and link prediction results using TransE, CompleX, and RotatE based methods. The link prediction model achieved up to 0.44 hits@10 on the entity prediction tasks. The recent outbreak of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), also known as COVID-19, served as a case study to demonstrate the efficacy of link prediction modeling for drug discovery. The link prediction algorithm guided identification and ranking of repurposed drug candidates for SARS-CoV-2 primarily by text mining biomedical literature from previous coronaviruses, including SARS and middle east respiratory syndrome (MERS). Repurposed drugs included potential primary SARS-CoV-2 treatment, adjunctive therapies, or therapeutics to treat side effects. The link prediction accuracy for nodes ranked highly for SARS coronavirus was 0.875 as calculated by human in the loop validation on existing COVID-19 specific data sets. Drug classes predicted as highly ranked include anti-inflammatory, nucleoside analogs, protease inhibitors, antimalarials, envelope proteins, and glycoproteins. Examples of highly ranked predicted links to SARS-CoV-2: human leukocyte interferon, recombinant interferon-gamma, cyclosporine, antiviral therapy, zidovudine, chloroquine, vaccination, methotrexate, artemisinin, alkaloids, glycyrrhizic acid, quinine, flavonoids, amprenavir, suramin, complement system proteins, fluoroquinolones, bone marrow transplantation, albuterol, ciprofloxacin, quinolone antibacterial agents, and hydroxymethylglutaryl-CoA reductase inhibitors. Approximately 40% of identified drugs were not previously connected to SARS, such as edetic acid or biotin. In summary, link prediction can effectively suggest repurposed drugs for emergent diseases. 
    more » « less
  5. null (Ed.)
    Tyrosine kinase inhibitors (TKIs) are the frontline therapy for BCR-ABL (Ph+) chronic myeloid leukemia (CML). A systematic meta-analysis of 43 peer-reviewed studies with 10,769 CML patients compared the incidence of gastrointestinal adverse events (GI AEs) in a large heterogeneous CML population as a function of TKI type. Incidence and severity of nausea, vomiting, and diarrhea were assessed for imatinib, dasatinib, bosutinib, and nilotinib. Examination of combined TKI average GI AE incidence found diarrhea most prevalent (22.5%), followed by nausea (20.6%), and vomiting (12.9%). Other TKI GI AEs included constipation (9.2%), abdominal pain (7.6%), gastrointestinal hemorrhage (3.5%), and pancreatitis (2.2%). Mean GI AE incidence was significantly different between TKIs (p < 0.001): bosutinib (52.9%), imatinib (24.2%), dasatinib (20.4%), and nilotinib (9.1%). Diarrhea was the most prevalent GI AE with bosutinib (79.2%) and dasatinib (28.1%), whereas nausea was most prevalent with imatinib (33.0%) and nilotinib (13.2%). Incidence of grade 3 or 4 severe GI AEs was ≤3% except severe diarrhea with bosutinib (9.5%). Unsupervised clustering revealed treatment efficacy measured by the complete cytogenetic response, major molecular response, and overall survival is driven most by disease severity, not TKI type. For patients with chronic phase CML without resistance, optimal TKI selection should consider TKI AE profile, comorbidities, and lifestyle. 
    more » « less